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2 INTRODUCT LUN ? 


WISE is an online interactive information retrieval system designed to 
operate on keyword-oriented databases. The system was designed particularly 
with bibliographic databases in mind, but is potentially useful for any . 
keyword-oriented database. The WISE user searches a database by entering 
and combining keywords that are appropriate to the database and-the subject 
area of interest. The user can also enter cerfain command statements to 
refine the results of the search, and to, control the generation of output. 
In addition, certain features of WISE make it very convenient to perform y 


SDI (Selective Dissemination of Informdtion) applications. 


The WISE system as a wale consists of five absolute programs and two “ 
. sets pt subroutines for creating, updating, and searching a database. The 
MACC document entitled wISk Seareh Program desoribes how to search’ an 
already existing-database. This document deals With the subroutines, and 
programs used to initially create and then update ,an orfline database. 


eo 


The reader is assumed to be familiar with fme WISE SEARCH Program, the 
FORTRAN V programming language implemented for Univac 1100 series compiters 


a 
FORTRAN subroutines, and simple EXEC-8 runstreams. 


‘ 
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CHARACTERISTICS OF THE ORIGINAL INFORMATION SOURCE ‘ ! 


° 


\ 
WISE databages can be created from a large variety of information sources. 


This document assumes that the original information source has the following 
properties. ¥ ; _ r 


1) lt exists on some computer-readable medium such as tapes, drum file, 


° 
cards, etc. , 


2) It is logically composed of a set of information records each of 


which contains: 


p) a unique ‘identification number, henceforth called a citation , 
number; 


b) a set of fields which are to be used as retrieval keywords; 


;c) other fields, some ‘or all of which may pe displayed online. 


3) It will be subject to periodic updates (addition ofgnew records, not 


the modification or deletion of existing records). 


The following is an example of an information record from the ERIC, RIE 
information source (Educational Resources Information Center, Resources 


in Education). It is typical of the type of record which WISE was designed 


to process. 
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ERIC M exsion Number —icenefica |. ; Legislative Authority Code for identi 
Won ngber sequenteally assigned to 4 SAMPLE ENTRY yng the Mepuistinabich Supported 
Gcuren'’ as they ate processed the research activity (when appli: 


: . ; cable) * ¢ 
‘ Author(s) ——_" ep ssesan 56 AA 123 456 ———~— Clearinghouse accession num! 
° 


Smith John D dohnason Jane 
Tithe ————*q-——— Career Education fur Women 
Central Uni Chicago. I — age fespon- 
Spons Agency - National Inst of Education + Sponsoring Agency — agency resp 


Orgatsatran where document orggin (DHEW) Washington. DC sible for initiating, funding. and 
ated Report No- CL -2081 § managing the research project. 
: gf — Pb Date May 79 a a 
Date published —— Contract NEC 00-2 0099 ‘ Report Number — assigned by orig 
y Note~ [29p . Presented at the Nanonal Confer 


Contract of Grant Number~ ¢cntract— 
numbers have NEC prefixes. grant 
numbers have NE G prefues 


inator 
ene on Career Education (3rd. Chicago. Il. 
May 1$ 17, 1973) Mates T ‘ 
Available from - Campus Bookstore, 123 College 7 Descriptive Note 
Ave Chicago. I 60690 ($3 25) - 
a. PRICE ME$065 HC-$6.58 < : 
Descriptors —*Career Opportunities, Career —————_———_ Descriptors — 


subject terms = which 
Planping, Careers, *Demand Occupations. characterize substantive content 
“Employment Opportunities, Females. Labor Only the mayor terms, preceded by 
Force, Labor Market, "Manpower Needs, Oc- 


Alternate soprce tor oblaiaung doc 
ments 


, rinted inthe sub 
EDRS Price price through ERIC Doc cupatronal Agpiration. Occupational Guidance. ci nag ae spriqted’ 
ment R = oa ti ee Me: Occupations, Vocational Counseling. * Working % b 

en eproduction Service Women 


means microfiche, "HC" means hard 


Identihers - Consortium of States, *Navonal Oc ——_____ 
copy When listed “not available trom 


= | ntifying © 
cupational Competency Testing Institute, Identifiers — additional dentifyng 


EDRS™ other sources are cited above Minow . emcee sects ws tne These 
‘ Women's opportunities for employment will be 0 escriplors. 
. direcly related to their level of skill and ex- . 
perience but also to the labor market demands 


through the remainder of the decade The num- 
ber of workers needed for all mayor occupational 
Categories is expected to increase by about one- 
fifth between 1970 and 1980, but the growth rate 
will vary by occupational group Profgssional and 
technical workers are expected to have the high a. 
est predicted rate (39 percent), followed by service 
S workers (38 percent), clencal workers (26 per- 
cent), sales workers (24 percent), crafismen and 
foremen (20 percent), managers and administra. 
tors (18 percent), and operatives (1) percent) This 
puNication contains a bref discussion and em- 
ployment information concerning occupations for 
professional and technical workers, managers arid ® ¥ 
administrators, shilled trades, sales workers, cler 
real workers, and service workers In order for 
women to take advantage of increased labor 
market demands, employer attitudes toward work: 
b ing women: need to change and women must * 
(1) receive better career planning and counseling, 
(2) change therr career aspirations, and (3) 
fully utsize the sources of legal protection and as 


sistance which are available tO them (SB) > ———— Abstractor’s initials 


—_—_——— Informative Abstract 


Before creating a database, the user should answer the following 


questions: 


1) Which fields are to be used-as retrieval keywords? In the example 


above, natural choices for retrieval keywords would be descriptors, 
ident ffiers, and perhaps author names. 
® 
12) If several sebeehine fields are to be retrieval keywords, is there any 
spelling overlap ‘among them? For example, suppose that author last | ‘ 
names and descriptors were chosen to be retrieval keywords. Would it 
be likely that the spelling of many (any) author last names would be 


the same as the spelling of descriptors? -If so, author last ®ames 
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‘ : 7 ‘ 
ty, ’ a om 
‘could be defined to be used as auxiliary keywords (see Section 3.1.2). 
bis would resolve the ambiguity. 


Which fields should be gtouued ‘0 mafn keywords? For example, if 

there were no spelling overlap between author last names and descriptors, 
both \of these fields could be grouped together as main keywords. (Main 
keywords dre described in the WISE: SEARCH program document.) 


4) Which (if any) of the fields should be treated as auxiliary keywords. 


When doing a search with the SEARCH program, auxiliary keywords are 


specified by using a special function name. Special functions are 


described in the WISE SEARCH program document. 


5) 


Precisely what textual information will be available online? \ 
6) 


In what format should this textual informatich be displayed? 


Items 5) and 6) refer to what is called additional information (Al) in the 
WISE SEARCH program document. 


The following is an example of one possible Al format. 


(Note: the information was taken from record 98271 and not record 654321 


which is shown above.) 
ED 98271 * 
SMITH, KATHLEEN, ED. 


DESEGREGATION/INTEGRATION: PLANNING FOR SCHOOL CHANGE. A TRAINING 
PROGRAM FOR INTERGROUP EDUCATORS. 
7TH / UD : o- : 


* 
The format we have chosen displays the following information. 


* citation number 
s 
* author(s) ‘ 


* title 


date of publication/the first two alphabetic characters fram the 
Clearinghouse accession number. 


Obviously, a wide variety of other formats is also possible. 


When all of the above questions have been answeredg initial creation of 
the database may proceed. 


3 z y 
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eo PROCESSING THE ORIGINAL INFORMATION SOURCE 


\ 
° \ 
Figure 1 depicts the overall process required to create an online WISE 
database. There are. three programs involved: XX-READ; MERGE, and BUILD. 
XX-READ requires the most explanation because when you are building a new 
database, you’must code the part of XX-READ that reads the original in- 
formation source. : : 


3.1 XX-READ ; 
\ ._  XX-READ is really a generic name which refers to a set of prograts, 
each of which is coded to receive input from a particular information 
source. For example, RIE-READ is a program which reads the ERIC’ RIE tapes. 
‘ 
CIJE-READ reads the ERIC CIJE tapes (Current Index to Journals in Educa- 


tion). You can name your own XX-READ program anything you like. 


Functionally, XX-READ must be capable of reading the input medium on 


which the original information is stored, isolating the specific data fields 


of interest within each logical record, and writing two different output 


tape files. These output files will be referred to'as the KEYWORD file and 


the Al (additional information) file. These output files are written in a 


- 


standard WISE internal format and once they are created, all other processes 


for database creation are performéd by the already-existing programs, 
MERGE and BULLD. 


Briefly, the KEYWORD file contains acai necessary to retrieve 
the citation numbers for those records containing a given keyword. The Al 
file contains the textual information which can be displayed online. \ 

‘A set of subroutines is available for writing the KEYWORD file 
and another set is available for writing the AI filé. The following 


sections describe how to use them. , ¥ 


: Ag «it 


é 
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MULTI-REEL 
_. AL 
TAPE FILE: 


. KEYWORD FILE 
(RECSIZ WORD RECORDS) 


FIRST SORTED 
KEYWORD 
‘TAPE 


LAST SORTED 
KEYWORD 
TAPE 


Ay records are 
assumed to be 
ordered by ascend- 
ing citation #. © 
2 2-WAY MERGE 
Use as many passes as 
necessary ‘to merge all tapes. 


MULTI-REEL 
SORTED KEYWORD 
TAPE FILE 


Distinct keywords which have 
the same hash address and 
residue are detected here. 


Actually builds the tata 
base and then dumps it to 

< tape. It doesn't do any 
* sorting. 


This program operates 
a differently for initial 

database creation than 
for updates. 


MULTI-REEL 
CITATION # 
RETRIEVAL 
TAPE FILE ~ 


MULTI-REEL 
AI RETRIEVAL 
TAPE FILE 


; e . 
’ 


Figure 1 + INITIAL ONLINE DATABASE CREATION \ 


10—iy. 4 


‘OLDB . 13-3] 
5/1976 . 


‘ 


bee 


@.. Writing the KEYWORD File : wt 


A set of three FORTRAN-callable subroutines is proVided for writing the 
* KEYWORD file. : ; 


KEYOPN (open the KEYWORD file) 
KEYOUT (output KEYWORD records) S 
KEYCLS (close the KEYWORD file) 


\ 
* dhe calling’ sequences are as follows: 


CALL cen (neve ews 12 tn RECS, EAD HEAL 
where: ‘ ; . ' a 


4 


KEYREC - the name of the FORTRAN array which will contain the keyword 


recards (see Section 3.1.2). 


KEYSIZ - the maximum length, in characters, of a keyword (suggested value 
a | 7 . 
is 60). 
; INLEN: - (KESS 12 + 5)/6 + 3. This 7 the size of the array KEYREC. Per 
Y <i RECSIZ - the length of a new record created internglly by subroutine KEYOUT 
F based upon the information supplied in array KEYREC. This record 
gontains a variable amount of diagnostic information based upon 
its lenggh. RECSIZ must be > 3 (suggested value is 6). PEG 


_  NUMBIT - 


the number of bits fn the hash address which is produced by a 
ps 
hashing function cglled -internally by subroutine KEYOUT (suggested 
value is 13). , : ‘ } 
HEAD - 


the name of a FORTRAN array containing control information for 


, 


the database (see Section 3.1.2), ° 


HEADL - theNength of the array HEAD. Currently,. the value of HEADL should 
be ll. 


F This may be expanded in the future if more control infor- 


mation is found to be desirable. ; 


The subroutine, KEYOPN, should be called before the first call to the sub- 5: 
routiné KEYOUT. : ; . 


~ 


& CALL KEYOUT 


it sends information sal the next keyword 
record (assemed to be in array KEYREC) to the KEYWORD file. 


Each time KEYOUT is called, 


( io 


i 


3.1.2 
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CALL KEYCLS 


This subroutine should be called when there are no more keyword records to 


pass to KEYOUT. 
: $ 


KEYWORD File Data Formats 
Array KEYREC should look like this upon entry to KEYOUT. 
KEYREC (1) 

KEYREC (2) 


: KEYREC (3) 


KEYREC (INLEN-3) 


KEYREC ( INLEN~2 ) 


KEYREC ( INLEN-1) 


KEY REC (LNLEN) Pear | MASK CITNUM 
f ly Cares Cis Saran ee eee 
7 bits 5 bits 24 bits 


FS : . 

Array elements KEYREC(1) through KEYREC(INLEN-3) inclusive are reserved 
for holding the Fieldata text of a keyword. The keyword must be left- 
justified and space-filled starting in element KEYREC(1). 

Example: = 


If you choose the maximum size of a keyword to be 60 characters (i.e., 


* KEYSIZ = 60), then INLEN = (KEYSIZ + 5)/6 + 35 13, and INLEN - 3 = 10. 


Therefore there are 10 array ebements available for holding a keyword. This 
is just enough because exactly six Fieldata characters fit in each array 

element for FORTRAN V implemented for a UNIVAC 1100 series computer. If the 
keyword contains fewer than KEYSIZ - 5 characters, the array element immedi- 


ately following the end of the keyword must contain six Fieldata spaces. 


This is very important! ts: 4 , ‘ iS Ft 
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=] . < ¢ ae! Byrd : N 
. : YEAR - 7-bit field which contains a binary number representing the last : 

° . 
two digits of the year of publication for a given citation. YEAR. - - : 
is primarily ‘useful for bibliographic databases. If it is mean~ 
, ; Tl asi for your database, just use zero. 


MASK - 5-bit. field that contains the bit settings which indicate that the 
4 ° 

; ‘keyword is to be used ‘as a main kejeoed or one of ‘the four | 

t auxtliary keywords. (The leftmost bit in‘a word (array cums 


is called bit 35. ) os + De 
a. : "bit 2821 - means main keyword. , RN : > 
' ‘pit 27=1- . means first auxiliary keyword. : 


. 
. . . 
. . ‘ 4 a 
a « 
x « : s 


_ bit 24 = 1 - means ‘fourth auxtliary keyword. 


a At least one bit must always equal i? However, it is ence 
: meaningful to have tore than one. bit equal 1. 


‘CTENUM - - 2h-bit field which contains the binary for the eitazion number. ~ 


- NOTE: © ‘The array element KEYREC (INLEN), ie broken into three odd=-sized 
pieces, 7 bits, 5 bits, and 24 bits. Those programmers who are not 
familiar with performing partial word manipulations in FORTRAN V “are 

'- advised to read about the FLD funct ion + oa 4 3.2 of, _the FORTRAN V 


"Compiler Reference Manual for the 1108.*, 


ae 


Hint: Put th nOEAeane six statements eee before ae first executable 


statement your HEOEE Ss 


PARAMETER’ KEYSIZ*= 60, mos eg 
"PARAMETER INLEN = = (KEYSIZ + 5)/6 +3 ~ ~ 
| DIMENSION KEYREC(INLEN) =, Veen at . 
vi ‘DEFINE YEAR FLD(0,7,KEYREC(INLEN)) 9. ! gS ae 
DEFINE MASK FLD(7,5,KEYREC(INLEN) )~ | ak ae 
DEFINE CITNUM PEO ERS REEL MENS a on ' 


a 
te 


ai] 


ae Then later on in. your program if you want to oe the YEAR field‘to 74 for 
! @ Sample, eis use the statement 


. YEAR =‘ 74,’ 
ee ts 


Pies 


' ’ 
« . 
2 @ ; ; 


Similarly, the atatement 


pe € os 


ae MASK = 1 
' ? e 
The statement” .* 


CITNUM = 9923 _ 


puts the value 9923 into the CITNUM field without changing the current ” 


values in the YEAR and MASK fields. Phe 


Note:: KEYREC(1) enough Seinicieeniss are changed by KEYOUT. Only 


KEYREC (INLEN) - remains unchanged. 
a = ey ’ i \ thie : 


“The array HEAD- should look like this upon entry to KEYOPN. 


6 bits 6 bits t 
fa NN : 


| can) 
¢ . 
i ae, ae Ne 
x : 4. 10 


Where: | i 


KWDHDR - the six “Fieldata characters: KWDHDR, 


KCNTRLN - the length of atray HEAD, -curren€ly ell. | . 


& DBNAME - six Fieldata characters specifying the-databdse rfame (you choose - 


it yourself). « . 


a | ; ” 


\ 


sets the rightmost bit*in thé’ MASK field and clears the other four bits. 


. 


wo 
ET > the binary representation for the number of bits in the hash add?®s 


(suggested, value- is 13). a - Po eS 


NUMAUX - the binary representation for the number of* ‘auxiliary, keywords « de- 
fined fdr this database (shouldbe between’ 0 and 4 Anclusive). : 


AUX1.- six Fieldata characters, left-justified, space-filled, for the 
special function that denotes auxiliary key #1. Use all Fieldata 


spaces if this word is not used. : 


p ‘ 


AUX2 - the same as AUX1 but for auxiliary key #2. 
AUX3 + the same as AUX1 but for auxiliary key #3. 
AUK4 - the same as AUX1 but for auxlliary key #4. 


‘RECSIZ - the length of the internal record produced by KEYOUT (suggested 
‘value is 6). : 


CITBAS - @he base value which is added to all the "relative" citation num- 

| bers referenced in this document in order to compute the actual 
‘citation number. The fields allocated for the,citation number in 
both the KEYWORD records and the Al records are 24 bits long. 
Therefore the highest citation number that can be placed into 
these fields is 274 - 1 = 16, 777p215, Most databases cs citation 
eae which range between numbers much smaller than 224 - 1. For: 
these databases we use CITBAS = 0, and we put the actual citation 
number into the citation number fields. Ce +0 = @)s see: ar 
Stephen C, Kleene, Introduction to Metamdthematics,’ (1950),-p. 186. 
If your database, starts with very largé citation numbers and the, 
largest numbers exceed 24 - 1, then you can still use WISE even 
though your citations nuabate won't fit“into a 24-bit field. Just 

a set “CLTBAS equal to the lowest citation af -l. Then call your 

first citation number 1, the second number 2, etc. , ie 


. The WISE system will always add CITBAS to compute the actual citation ~ 


° ¢ 
number. 


Note: CITBAS does not allow us to have tore than 2 aad 1 distinct citation 


numbers. It just allows us to have very large citation numbers. 


PREFIX = six Fieldata characters to-be printed just before the citation humber 
when additional information is not requested. For example, for the 
RIE database, PREFIX =4.,.ED" 


. ead ~ 


The calling ianieacis are as “follows.” * “f 
. : a 
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Writing the Al File x 


oNA set of three FORTRAN V callable subrout ines rr provided for writing the 
Al file. ; © : a 
ALOPEN (open the,Al ‘file), * : : : 
ALOUT , Xoutput AL records) St 

‘Sicis (qlose the AL, fie) 


a 


CALL AIOPEN(AHEAD,AHEADL}) = te eg 8 


ru ' < 
where: Pits a 


AHEAD - the name of a FORTRAN iy containisig control information for 
the database (see Section Jel. 4). 


’ 
ry 


AHEADL - the length of the areay HEAD. cheery the value of AHEADL 
* 2 e 


should.be’ 4. This may be expanded in the future if more control 
information is found to be desirable. 


a - 
‘ 


? ; 
CALL AIQUT{AIREC) ie 


he . 


ALOUT assumes that each time it is called a new AI record (see Section 


*, 3.1.4) has been placed into the array AIREC. Records should be presented 


to ALOUT in order of increasing citation number: 


CALL AICLS 


ve 


Sthis subroutine, should be called when’there are no more AI records to 
pass to ALOUT. 


. 


Al File Data Formats ; . i 


. ‘ ~ 


The array AHEAD should 106k like this upon entry to AIOPEN. 


ws 
AHEAD (4) | : 


| ao — 


bis 
§/1976 


. } 


_~ The array AIREC showld ‘look like this upon entry to AIOUT. 


where: ° 


+ 


AIHDR - the six Fieldata characters: ALHDRS a, 


ACNTRLN - the length of the array AHEAD, currently = 4. 


’ 


ANUMBIT - a value pertaining to an internal table used by the Satu program / 


to retrieve additional information (suggested value is 13). 


h 


AIREC(1); ', : 
AIREC (2) " 
AIREC(3) ° Text of field in Fieldata, 
left-justified, ee | 
. in WDCNT words. 
‘ 
a ® 4. 
. 
* 
where: \ , 


RECLEH- - a 12-bit- field which contains the binary representation for the total 


record length (in worgs) of this Al tecord. > 


‘ 
- 


CITNUM - a 24-bit field which contains the bidary representation for the 
‘citation nunber for this record. | 


» 


| i 

‘ Ps - 

‘(FTYPE and WDCNT comprise'a print field control word.) 
2 ie 


FTYPE = an 18-bit field containing the field |type in binary. 


WOCNT - an 18-bit field containing the numbe of text words in this figld 
, (it may be 0). , 


« 
The additional information records placed int@ the array *ALREC can ie vari- . 
able in length. The maximum length is a. - Pe 4095 words. 


U 


5 Er 


/ 


@. 


1.5‘ Rules for Coding and Mapping XX-READ 


° 
records. 
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The fields are arranged in the order they will be printed, not neces- 


sarily in the order they are found in the original records. The print 
fields may be composites or subsets of the* fields in the original data 


Therefore the field types may sometimes be internally generated,¢ ‘ 
(by XX-READ). 


~ 


“Printing conventions for AI records (see’ p. 2-3 for an example): 


The following printing conventions are followed by the print routine in 
the SEARCH program. 


The printer or terminal is spaced before printing the 
first line. 


The first line, consists of the prefix word and the citation — 
number. Field type 100 indicates the prefix word. This,consists of exactly 
six Fieldata characters which are to be printed on the first line immedi- . 


ately preteding the citation number. The citation number is printed in 18 ‘ 


format. The text of the next field prints on the line immediately following 


the citation number. The text of a new field always starts printing on a 
——— 4 


s very long, it will be broken: into as many lines as 
are necessary. to print all of it. any field in the: ‘Al record , including 
‘the prefix word may be omitted from an Al record. 


/ a % : . 


new line. If a fiel 


"1 All variables referred to in this document must be declared as type 


integer. / This can be.easily” accomplished by naing ‘the statement 
: oe sae INTEGER(A-Z) 


as the first FORTRAN ‘statement in XX-READ. 


’ e 


‘2), If youn XX-READ program contains any READ statements, it must perform - 


- ail of Hts READ's before before cthe first first fall to KEYOUT. The most likely 
reason ffor doing a READ would probably be to find out the number of 
input ape reels. 


3) Assum g that you are working with a ‘file called A, the following two , 
* statements will properly map XX-READ for you. 
,S NEWWISE, XX-READ-MAP, A. XX-READ am 
ti XX-READ : 


You must substitute your own program name! for the underlined occurrences 
a of XX-READ above. 


‘ 


@..; dy BOs ~ ps 
" -1.6 Debugging Aids for XX-READ ; 


’ 
: 


MACC has developed some utility rout ines which are useful for examining 


the contents of the, two tape files produced by -XX-READ. * these are available 
on an informal basis from the MACC staff member responsible ‘or WISE, 


1o 


* The Rules for XX-READ 
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Chapter 4 @® 


‘INITIAL DATABASE CREATION 


’ 


Atythis point, we assume that the appropriate XX-READ program has been 
coded, “mapped and debugged. This portion of the document deals with the 


initial creation of a catabane (see Figure 1). ‘ a | 


t \ 
* x 

“The ides is a sample ronstxesn for an XX-READ execution, dssuning 
that the\ original Storey tee source is a set: off 7-track tapes. XX-READ 7 
is astm to be in file A. E , ; \ 
: er tal| 
@RUNL/R runid,project,user, dollar limit \ 


r 


ASG, . 7.,1,<tape reel ID's SOnnestOHnTna” to the original: ifort 
, source. tapes> 


@ASG, TN 8.,1,<tape ree] ID's for i to receive the KEYWORD Hite» 
@ASG,T 3: »1,<tape reel ID's for tapes to receive the AI file>|' 
@ASG, of XB. »F/1800/TRK/2149 | 
@ASG,T XA. »F/250/TRK/ 307 : | 
¥@XQT,<options> A.XX-READ - a , 
. <Data images for: any "READ! s performed prior to quilting KEYOUT> | 


\ 
The tape reél ID's for units 8. and 9. must be preceded by 3! al) 
because they are output tapes. 


option is specified on the @XQT statemente 


"FIN . a 5. 


Note: 
‘Full 2400 foot tape reels must be 
used for file 8: Short. reels cause XX-READ to terminate in error. 
The files XB. and XA. are used by the ‘system ceckl ietn the (called 
internally by KEYOUT). The file sizes were co ited for RECSIZ < <6 


. t. 
_ Summary of ae stay ; ? 


NONE - Tie oufput bane files are produced in their tandard format. 


te @ 


E-A software ‘EOF (end of file) Fagdra is not ee on the last * 
‘tape of the AI file. This is: used when an initial database: 
creation must be split into several executions (e.g., because 

of elapsed time consideration). In this case, all executions 
“except the last one shduld use the E XQT option. ® 


» 49 


« 


4.2 Using MERGE 


Pad 


. 
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\ ; 5 7 
H - The Al\headet regord is not written when ALOPEN is called. This is 


also necessary when multiple executions are required. Every execu- 


. tion extept 


P - If the B opti 
subroutine 
to optim}ze i 


e first one should use the H option. 


is specified, the system sort routine (called by , 
OUT) reads information from the runstream in ay,attempt 
sorting. “(See Section 4.5 of the MACC manual, File 


(FILE 8. above). A bias value of 1.9 is recommended. 


creation, then the MERGE prggram is not necessary. ’ The single, sorted 
keyword tape may, be directly nput to the BUILD program. However, if 


- before input: to BUILD. This merging is necessary because each individual 
tape is the result of sorting only part of the total ‘set of keyword records. 
MERGE is coded to \perform two-way merges.’ Therefore, if n tapes were pro- 


duced, then n - 1 MERGE runs are necessary to get them all merged. 
a’ ‘ 


* \ ny 
You ¢an sde from\Figure 2 that MERGE can read files written both by . 
* *. XX-READ and oni as runs. = S 


. 


OLDB sas ; 


.* 
‘ 
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/ 
. 
lst | 
MULTI-=REEL 
K \ MERGE ; 
, FILE 
\ 
i 
*» 
d 
¢ | - 
minctaea _—_ ; > 
. ¥ cm 
Ww 
s a 
* f 
21 . é 
’ ” Figure 2. Merging 


. i ‘ ry 
x J 
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@.. The*Runstream for MERGE 


4.3 Using BUILD 


‘ 


_ The Yel lawns should do the trick. { ? 


@RUN, JR runid, project,user,do] lar limit . 

@ASG,T 7.,1T,<tape reel ID's far first inpu file> 
: @ASG,T 8.,1,<tape reel ID's- "for second input file> 
@ASG,T 9.,1,<tape reel ID's “for output file> of 
@XQT “NEW*WISE. MERGE : 


QFIN. e . | i 


s 


During an initial creation, BUILD reads KEYWORD tapes written by MERGE 
(or XX-READ) and AI tapes written by KX-READ. It then creates a two part 
database which it nicely dumps onto two multi-reel tape files: : 
1) citation number retrieval structure, - 


2) AI retrieval structure. 


The input Al Ganka may be missing, depending on the XQT options specified. 


Of coursey input tape files must be consistent with the options sppeified. 
: Se : 


XQT Options: | J 


-#None - inifial build, only keyword tapes are expected. 


e \ 


U (alone) - update, only keyword tapes are \expected. 


A - Al tapes are also expected. ‘ vy 
H - bypass AI header records found on AI tapes othi r than the first 
tape. This is for people who forgot to use the H option on the 


appropriate XX-READ executions. Without the H ption, two 


¢ header records in the samé AI file is considered an error and 


BUILD will perform an error stop because the wrong, tapes are 


. { 
i probably mounted. ~ 


f 
t 
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nie ubatteae for BUILD (Initial Creation) 


The following runstream is|appropriate for an initial database creation 
where both keyword tapes and Al tapes are expected as input. 


@RUN,/R runid,project,user, dollar limit 


@ASN,T 7.,T ,<tape, set 1>/<tape set 2>/<tape set 3> | | . 

@XQT,A NEW*WISE. BUILD | ( 
\@FIN 

\ 


a 
<tape bet l> =a set of tape reel ID's representing the output from MERGE 
‘ \. (or XX-READ) - 


_ <tape set 2> = a set “of tape reel ID's representing the multi-reel sg tape 


file produced by XX-READ 


<tape set 3> = a set of tape reel ID's represent ing output tapes that will 
receive: 


23 1) the ore oe ‘cake retrieval tape file 


2). the multi-reel Al retrieval tape file. 


In general it is a good idea to supply a large number of tapes in tape 


set 3. ilies Starts writing the Rader eee citation number retrieval struc- 
ture on ‘the first oytput tape. 


dump the whole structure. 


It uses as Many tapes as are necessary to 


. 


When this process is done, it prints the message: 


EOF RECORD WRIPTEN ON TAPE REEL # n . 


where n is the ne 


output reel (by this means, you know where the multi-reel 
file ended). 


In the example above, an Al retrieval structure is also to be 


written, so BUILD will, s@tch to the next output tape and start dumping the 


AI structure. When this process is done you get another EOF message. The 


value of n is to be interpreted to be the art tape in tape set 3, not the 
th 


tape within a multi-reel file. 
’ 


ad 
(5-1] 
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RUNNING SEARCHES ON NON-MACC DATABASES 
“ 


‘ 


: Databases,not maintained by MACC are referred to as non-standard databases 


internally to the SEARCH program). r 


how tO load a non-standard database. | “= ! 


@ASG,T 7.,T,10615/10616 |‘ 

READY 

@NEW*WISE.SEARCH,N YOUR*DATABASE. 

SEARCH - VERSION 01.02 ATE:042676 TIME: 170447 
DQ YOU WANT TO PAY TO LOAD’ THE DATABASE? YES 


DO YOU ALSO’ WANT TO LOAD, RHE ADDITIONAL INFORMATION? 
DO YOU WANT TO CATALOG THE DATABASE? YES > 


/ .PARDON ME WHILE I POLGGGr MY MEMORY 


DATABAS& LOADED 


(non-standard in the sense that the tapes on which~they reside are not known 


e following computer output demonstrates 


YES 


TO OBTAIN INFORMATION ON HOW TO USE THIS ‘PROGRAM, INPUT: 


/HELP 


/HELP 
NEWS 


FOR COMMENTS OF GENERAL INTEREST TO WISE USERS, AEDS 


ACTION 


_ tHIGHER EDUCATION 
SET 1 410 ENTRIES 


. 
ue ‘ : 


:/QUIT 


END SRARCH 
@SAVE,S YOUR*DATABASE. 
MACC SAVE PROCESSOR VERSION 6.0406 


YOUR*DATABASE( 1) TO BE SAVED INDEFINITELY . 
END SAVE. 
_ @FIN - ' 


Y 


There are two important things to remember when loading a non-standard 
database: . s 
’ 


~ 3 


1) “A tape assign statement for unit 7., must precede the @NEW*WISE.SEARCH,N 


processor call statement. — 


Thé SEARCH program must be called with an N option as shown above. 
_ This tells SEARCH that it is aOnCIG non-standard database. 


2) 


a. — 


’ 


: é 571 The Tape As 


75.2 
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The general form of the tape assign statement is: 
@ASG;T 7.,<tape set 1>/<tape set 2> 


Ktape set l> is a set of reel ID's corresponding to a multi-reel citation 
number retrieval tape file (the first output file from BUILD). + ' 


<tape set 2> is 4 set of reel ID's corresponding to a multi-reel AI re- 
trieval tape file (the second out pat file from BUILD). This set may be 
absent if you don't want to load the Al file. , 

In the above example, tape set 1 consists of exactly one tape, 10615. 
Tape set 2 consists of ex&ctly one tape, 10616. 


The Processor Call a . : by 


The general form of cg processor call statement ! for non-standard data- 
bases is : 


, © @NEW*WISE.SEARCH,N FILENAME. 


° - 
or 


~ 


@NEW*WISE.SEARCH,N QUALIFIER*FILENAME. 


oo? 


The final period is mandatory! FILENAME. and QUALIFIER*FILENAME. are the 


two most common ways of specifying an EXEC-8 filename. (See the MACC 


Computing Handbook, Section 3.2, for EXEC-8 file naming conventions.) 


le with the name specified 
on the processor call statement, and either c talogs ,it (PUBLIC, READ-ONLY) or 
makes it a temporary file. 


During an initial load, SEARCH creates ay 


(This is dgtermined by- the user's response to 
the question "DO YOU WANT TO CATALOG THE DATABASE?".) It then loads the 


database ‘from the tapes on unit 7 and internally @FREE's the tape file. 


. SEARCH then enters ACTION mode and you may start searching as described 


dn ne WISE Toe progres doniusialits ; " > 


If you want-your file to 
be saved you must do it yourself with the ‘@SAVE command. (See the MACE 


Computing Handbook, Section 3.9.2, and the section on @SAVE in the blue 
pages.) 


SEARCH does not SAVE the file it catalogs. 


| , SY 
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A note of caution: WISE database files are cataloged with position 
Beanibayttys If your file exceeds 23 positions, MACC will not be able to 
actually save your file for you. However, the @SAVE processor will not 
notify you of this condition. Although the situation is'not very likely 
because 23 positions represents an extremely large file, it is nevertheless 


a good idea to check the size of your file with the statement:- 


@PRT,F YOUR*DATABASE. 


If you have more than 23 positions, contact MACC in order to make:your 
file a MACC standard database. This will solve the problem. 


The following is\an example of accessing a non-standard database that 
was previously loaded and cataloged. 


nN 


@NEW*WISE.SEARCH,N YOUR*DATABASE. 


SEARCH - VERSION 01.02 DATE: 042676 TIME: 171310 
, TO: PET pen INFORMATION ON HOW TO USE THES PROGRAM, INPUT: 
/ HELP 
FOR COMMENTS OF GENERAL INTEREST TO WISE USERS, INPUT: 
/ HELP 
NEWS 


ACTION 


:HIGHER EDUCATION 
SET 1 410 ENTRIES 


. t INSTRUCTIONAL MATERIALS. 
SET 2° 243 ENTRIES 


2182 Mi 
er 3 g ENTRIES ; 
:/QUIT a a he : , , 
END SEARCH / As Be aa Soa 
@FIN / 
/ 
/ . 
/ 
/ 
- / , 
Be SP i 
jo 
26 
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: oy Chapter 6. 
& a a “ARARASE UPDATE _ 
a a nm oe os - 4 
sree Figure, 3_is an overview of how to perform a database update. The opera- : 
* tion of XX-READ and MERGE, should be obvious ‘from the -dfagram. “The runBt reams ~ 
: should be about the s as those’ already shown for initial datsiirane creation. 
i The only program, that_ operates differently during an’ update is BUILD. ; bs 


oes eee “Basically the ‘differences are: i 
i: ’ 


1) aUL must be called with a U XQT spiders (and others if appropriate). 


2) The. cane ‘assign statement for — 7 is different. 


6.1 “The Runstrean for BUILD (Database, Update)~ 


The following runstream would be appropriate ‘for updating a database get 
that ‘eontained both a citation number retrieval tape file and an AI re- 
trieval tape file. . i ; . : — i 
; @RUN, /R_runid, project, user, digi'lax limit _— . 
@ © As6,t 7.37,<tape set 1>/.../<tapeset 4> “Oe i 
RE @XQT,UA NEW*WISE.BUILD re i ; ; Pe 
@FIN @e | r 


a 
% 
hy 


<tape, set 1> = the multi-reel sorted keywoéd tape file from the update 


MERGE’ run. (output from MERGE in Figure 3). 


‘ <tape set 2>.= 


. 


the: multi-reel Al retrieval tape file: that was created 
by BUILD. during the ptévious update (or initial pie Bie 


<tape set: 3> = the mults-reel AL tape. file etnatad by XX-READ during” 


te * the’ current dail process. 


* <tape set 4> a set of tata to receive the new (updated) multi-reel 
citation number retrieval tape file, and the new (updated) 
multi-reel AI retrieval tape file. , The out pute 8 are 


® © ° processed exactly as in an initial database exeecten: 


When you want to load the new database that wal” written on tape set 4 


: e 1) Delete the old database (@DELETE,C YOUR*DATABASE. will work); 


2) “Use the SEARCH program to load the updated database from the new tapes. 
: 2 é : a 


; eee Poe 
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INFORMATION 
SOURCE 
UPDATE 


LAST SORTED . 
KEYWORD TAPE 


'PIRST SORTED 
KEYWORD. TAPE 


MULTI-REEL 
SORTED KEYWORD 
TAPE FILE FROM 
PREVIOUS UPDATE 


MULTI-REEL 
AI RETRIEVAL _ 
TAPE FILE FROM 

PREVIOUS UPDATE 


MULTI-REEL 
CITATION # 
RETRIEVAL 
TARE FILE 


- Figure 3 — ONLINE DATABASE UPDATE 
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